Norm Based Causal Reasoning in Textual Corpus

نویسنده

  • Farid Nouioua
چکیده

Truth based entailments are not sufficient for a good comprehension of NL. In fact, it can not deduce implicit information necessary to understand a text. On the other hand, norm based entailments are able to reach this goal. Let us consider this text [1][2]: ”the vehicle in front of me braked”. Using a truth based approach; we can obtain all the logical consequences of a formula such as: (∃v, t) V ehicle(v)∧Instant(t)∧In−Front−Of(v, me, t)∧break(v, t). While norms provide further conclusions like: v and me were in the same direction, no vehicle was between v and me, I had to brake when v braked . . . This idea was behind the development of Frames [3] and Scripts [6][5] in the 70’s. But these theories are not formalized enough and their adaptation to new situations is far from being obvious. Actually, no repository of norms is available for a given domain. Moreover, norms are seldom made explicit in texts, because as Schank noticed, texts do not describe the normal course of events but focus rather on the description of abnormal situations. The motivation of the present work is to extract norms by detecting their violations in the texts. We are working on a corpus of 60 texts describing car crashes. For each text, we are searching the cause of the accident as perceived by a standard reader. We hypothesize that the perceived cause of an abnormal event is the violation of a norm (anomaly). Among all the anomalies evoked by a text, one of them is considered as ’primary’. It represents the most plausible cause of the accident. The other anomalies result from the primary one and are called derived anomalies.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detecting and Explaining Causes From Text For a Time Series Event

Explaining underlying causes or effects about events is a challenging but valuable task. We define a novel problem of generating explanations of a time series event by (1) searching cause and effect relationships of the time series with textual data and (2) constructing a connecting chain between them to generate an explanation. To detect causal features from text, we propose a novel method bas...

متن کامل

Analysing Air Incident Reports: Workshop Challenge

To promote discussion at the Fourth Workshop on Textual Case-Based Reasoning (TCBR), we set a workshop challenge. We encouraged all potential workshop participants, individually or in their research groups, to submit a short paper that addressed this challenge. The purpose of this paper is to explain the challenge. We challenged participants to do the following: analyse the corpus of Air Invest...

متن کامل

Discovering Causal Relations in Textual Instructions

One aspect of ontology learning methods is the discovery of relations in textual data. One kind of such relations are causal relations. Our aim is to discover causations described in texts such as recipes and manuals. There is a lot of research on causal relations discovery that is based on grammatical patterns. These patterns are, however, rarely discovered in textual instructions (such as rec...

متن کامل

A Corpus-Based Study of the Lexical Make-up of Applied Linguistics Article Abstracts

This paper reports results from a corpus-based study that explored the frequency of words in the abstracts of applied linguistics journal articles. The abstracts of major articles in leading applied linguists journals, published since 2005 up to November 2001 were analyzed using software modules from the Compleat Lexical Tutor. The output includes a list of the most frequent content words, list...

متن کامل

Case-based Reasoning for Diagnosis of Stress using Enhanced Cosine and Fuzzy Similarity

Intelligent analysis of heterogeneous data and information sources for efficient decision support presents an interesting yet challenging task in clinical environments. This is particularly the case in stress medicine where digital patient records are becoming popular which contain not only lengthy time series measurements but also unstructured textual documents expressed in form of natural lan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/cs/0610016  شماره 

صفحات  -

تاریخ انتشار 2006